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Abstract. We give the first analysis of a systematic scan version of the Metropolis algorithm. 
Our examples include generating random elements of a Coxeter group with probability determined 
by the length function. The analysis is based on interpreting Metropolis walks in terms of the 
multiplication in the Iwahori-Hecke algebra. 

1. Introduction 

When faced with a complex task, is it better to be systematic or proceed by making random 
adjustments? We study aspects of this problem in the context of generating random elements of a 
finite group. For example, suppose we want to fill n empty spaces with zeros and ones such that 
the probability of configuration x = (xi, . . . ,a;„) is ^"■"'^'(l — 0)1^1 with |x| the number of ones in 
X. A systematic scan approach works left to right filling each successive place with a 9-co'm toss. 
A random scan approach picks places at random and a given site may be hit many times before all 
sites are hit. The systematic approach takes order n steps and the random approach takes order 
julogn steps. 

Realistic versions of this toy problem arise in image analysis and Ising like simulations where 
one must generate a random array by a Monte Carlo Markov chain. Systematic updating and ran- 
dom updating are competing algorithms discussed in detail in Section 2. There are some successful 
analyses for random scan algorithms, but the intuitively appealing systematic scan algorithms have 
resisted analysis. 

Our main results show that the binary problem above is exceptional; for the examples analyzed 
in this paper, systematic and random scans converge in about the same number of steps. 
Let be a finite Coxeter group generated by simple reflections si, • • • 1 ■Snj 

where s? = id. 

For example, W may be the permutation group Sn.+i with Si = + 1). The length function £{w) 
is the smallest k such that w = s^^ ■ ■ ■ Si^ . Fix < ^ < 1 and define a probability distribution 
on W by 

a-l{w) 



TTiW 



where ^^(^"1) = ^ (1.1) 
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is the normalizing constant. Thus Tr{w) is smallest when f/; = id and, as 6* 1, vr tends to the 
uniform distribution. These non-uniform distributions arise in statistical work as Mallows models. 
Background and references are in Section 2e. 

A standard Monte Carlo Markov chain algorithm for sampling from tt is the Metropolis algo- 
rithm with a systematic scan. This algorithm cycles through the generators in order. If multiplying 
by the current generator increases length this multiplication is made. If the length decreases, then 
the multiplication is made with probability 9 and omitted with probability 1 — 9. One scan uses 
si,S2, • • . ,Sn-i,Sn,Sn,Sn-i,. • • in Order. Define 

K{w,w') = the chance that a systematic scan started at w ends in w' . (1-2) 

Repeated scans of the algorithm are defined by 

K^{w,w') = J2k^~\'^^w")K{w",w'), £>2. (1.3) 

w" 

In Section 2c and 4a we show that this Markov chain has tt as unique stationary distribution. 

The main results of this paper derive sharp results on rates of convergence for these walks. 
As an example of what our methods give, wc show that order n scans are necessary and suffice to 
reach stationary on the symmetric group starting from the identity. More precisely, we prove 

Theorem 1.4. Let Sn be the permutation group on n letters. Fix 9, < 9 < 1. Let 
Kf{w) = K^(id,w) be the systematic scan chain on Sn defined by (1.2) and (1.3). For £ = 
n/2 - (log n)/ (log 6*) + c with c> 0, 

\\Ki - neWl^ < (e^"^' - l) + n!r'/8-"(^°«")/(^°se)+"('=+i/4). (1.5) 

Conversely, if i < n/4 then, for fixed 9, \\KI — Tr\\j,y tends to 1 as ra ^ oo. 

The total variation norm above is defined in Section 2a below. Note that the upper bound 
in (1.5) tends to zero for c large, so that about n/2 scans suffice to reach stationarity. The lower 
bound shows this is of the right order for large n. 

Each scan above uses 2n multiplications. Thus Theorem 1.4 implies that the systematic scan 
approach reaches stationarity in operations up to lower order terms. We also conjecture that 
the random scan approach (see Section 2b) for this example takes order operations. Further, 
in Section 7, we prove that the scan based on the sequence 

(Si, S2, . . . , Sn, Sn,---, Si), (si, . . . , S„_i, S„_i, . . . , Si), . . . , (si, §2, §2, Si), (si. Si) 

converges in one pass. Thus, again, up to lower order terms, operations suffice to reach sta- 
tionarity. These results shows that various different scanning strategies take the same number of 
operations to reach stationarity. 

One novel aspect of present arguments is our use of the Iwahori-Hecke algebra H spanned by 
the symbols {Tyj}yj^w This is generated by Tj = Tg. , 1 < i <n with the relations 

' " \qT,^y, + {q-l)T^, if iisiw) < i(w). 

We have succeeded in giving an algebraic interpretation of the Markov chain K(w, w') as multipli- 
cation in the Iwahori-Hecke algebra H. Prom there, knowledge of the center of H (via a result of 
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Brieskorn-Saito and Deligne) allows us to explicitly diagonalize K{w,w'). Convergence bounds are 
given in terms of the eigenvalues and the generic degrees of representation theory. Then calculus 
leads to results hke Theorem 1.4. 

Section 2 collects together probabilistic background and tools. We explain Markov chains, the 
Metropolis algorithm, systematic scans, and relate the basic Metropolis chain to a natural walk 
on the chambers of a building. In Section 2c we develop properties of the measures vr. Some of 
these are new even for reflection groups of type A (the symmetric group). These properties will 
be applied to prove lower bounds for walks as in Theorem 1.4. 

Section 3 collects together representation theoretic background and tools and connects the 
representation theory to Markov chains. Section 4 connects Hecke algebras to the Metropolis 
algorithm and specializes the results from Section 3. A basic upper bound for convergence is 
derived by relating two inner products. 

Sections 5 and 6 derive results for the hypcrcube and the dihedral groups. Here we find that 
both the systematic and random scans converge in about the same number of steps - the differences 
are only in the lead term constants (which are functions of 9). 

Section 7 derives results for two different systematic scanning plans for the symmetric group. 
Though we do not have the space to treat further examples in this paper, it should be remarked 
that the methods of Section 7 should also produce analogous results for the Weyl groups of type 
Bji and the imprimitive complex reflection groups G(r, 1, n). The long and short systematic scans 
can be defined in a similar way and the representation theory goes through without problems (see 
[Hf], [Ra] and [AK]). The remaining necessary ingredient is an analogue of Lemma 7.2. 

Acknowledgement. This paper is dedicated to our friend Bill Fulton. We are also thankful to Ruth 
Lawrence for early efforts to help understand deformed random walks. 

2. Probabilistic background 

In this section we give background material for Markov chains, the Metropolis algorithm, and 
systematic scans. In Section 2d we interpret the basic walk as a walk on flags and the chambers 
of a building and in Section 2e we derive basic properties of the stationary distributions. 

2a. Markov chains 

Background for Markov chains may be found in any standard probability text e.g. Feller [Fe, 
Chapter XV]. For the quantitative theory developed here see Saloff-Coste [SC] and the references 
therein. 

Let X be a flnite set. A Markov chain on X is a matrix K = {K{x, y))^ y^-^ such that 

K(x,y) G [0,1] and ^i^(x,2/) = l. 

vex 

The set X is the state space and K{x, y) gives the probability of moving from x to y in one step. 
Powers of the matrix K give the probability of moving from x to y in more steps. For example, 

K\x,y) = Y,K{x,z)K{z,y) 

indicates that to move from x to y in two steps the chain must move to z and then from z to y. 
The chain is irreducible and aperiodic if there is an ^ > such that K^{x,y) > for all x,y e X. 
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The chain K is reversible if there is a stationary distribution tt:X^ [0, 1], tt{x) = 1, such 

xex 

that, for all x,y e X, 

Tr{x)K{x,y) = Tr{y)K{y,x). 

For irreducible aperiodic K, reversibility implies that, for each x e X, the real numbers K^{x,y) 
converge to 7r{y) as £ ^ co. 

The quantitative theory of Markov chains studies the speed of convergence. The total variation 
distance of K^{x, •) to vr is defined by 



A„ — TT -1/ = max 



^K\x,y)-TT{y) 
yeA 



Using the set A = {y G X | K^{x,y) > 7r(y)} it is easily shown that 

ll^^-^ii.v = ^Ei^'(^'y)-^(y)i- (2-1) 

Let LF'{'k) be the space of functions /: X M with the norm 

a5). = E/(^)5(^M^)- (2-2) 

The following lemma provides a relation between the total variation and the i^(7r) norms. This 
bound is the primary tool for studying rates of convergence of Markov chains. 

Lemma 2.3. Let f G L^iw). Then \\f\\l^ < l\\fM\l 

Proof. By the Cauchy-Schwartz inequality. 
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2b. Systematic scan algorithms 

Let TT be a probability distribution on a finite set X and lei Ki,K2, . . . , Kn be Markov chains 
on X each having stationary distribution vr. Then any product Ki^Ki^_^ ■ ■ ■ has stationary 
distribution vr and a choice of an infinite sequence {i^j^i gives a scanning strategy. A ran- 
dom choice of indices gives a random scanning stategy. If each Ki is reversible for vr, then 
KiK2 - ■ ■ Kn-iKnKnKn-i ■ ■ ■ K2K1 is an example of a reversible systematic scanning strategy 
(while Ki ■ ■ ■ Kn is not necessarily reversible) . 

In routine applications of the Metropolis algorithm to image analysis and Ising-like models the 
state space has coordinates. Randomized strategies choose a coordinate at random and attempt 
to change it. Systematic strategies cycle through the coordinates in various orders. Fishman [Fi] 
reviews the literature on scanning strategies and gives some practical comparison. The scheme 
underlying Theorem 1.4 is Fishman's Plan 3. 

There has been some rigorous work on rates of convergence for systematic scans in a related 
case: Gaussian distribution of coordinates with the stochastic updating done by the heat bath 
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algorithm (also known as Glauber dynamics or the Gibbs sampler). One fascinating study by 
Goodman and Sokal [GS] relates scanning strategies to standard approaches for solving large 
linear systems. They show that the systematic scan heat bath algorithm is a stochastic analog of 
the Gauss-Seidel algorithm. Moreover, they show how previous analyses of Gauss-Seidel give the 
eigenvalues of its stochastic counterpart. Amit [Aml-2] and Amit and Grcnander [AG] have pushed 
forward and carried out these ideas to give some comparison of systematic and randomized sweeps 
in the Gaussian case. Their approach uses the fact that the heat bath algorithm is a projection 
operator. In the Gaussian case the problem reduces to the computation of angles between subspaces 
of a Hilbert space. Baronne and Prigessi [BF] and Roberts and Sahk [RS] are related references. 

2c. The Metropolis algorithm 

The Metropolis algorithm gives a way of changing the stationary distribution of a given Markov 
chain into any distribution. It was invented by Metropolis et al [MR]. A clear description is in 

Hammcrslcy and Handscomb [HH] and a recent survey appears in [DS]. 

Let X be a finite set. Let P{x,y) = P{y,x) be a symmetric Markov matrix on X and let tt 
be a fixed probability distribution on X. Form a new chain by the following recipe: 



' P{x,y), 



M{x,y) 



P{x,y) 



7r(x) ■ 



if X 7^ y and 7r(j/) > 7r(a;), 
li X ^ y and 7r(y) < 7r(x), 



(2.4) 



P{x,x)+ ^(^,^)fi-44Y ifa; = y. 



7r(z)<7r(x) 



In words: 

Form the Metropolis chain from x by choosing y from P{x,y). If 7r(y) > tt{x) move to 
X. If 7r(y) < Tr{x) flip a coin with chance of heads 7r{y)/Tr{x). If the coin comes up heads 
move to y. In all other cases stay at x. 

As shown in the references above, the Metropolis chain is reversible with stationary distribution 

TT. It is of practical importance that the chain M can be run knowing tt only up to a normalizing 
constant. Irrcducibility and aperiodicity of M must be checked on a case by case basis. 

An example of interest is X = W, where is a finite real reflection group generated by simple 
reflections si,S2, ■ ■ ■ ,Sn- Let P{x, y) be the Markov chain given by 



P{x,y) 



1/n, if y = SiX for some i, 
0, otherwise. 



Here P{x,y) is the usual random walk based on a generating set. It has uniform stationary 
distribution. Fix 6*, < ^ < 1 and let vr be as in (1.1). The Metropolis construction gives the 
Markov chain 



M{x,y) = < 



1/n, 
9/n, 



{six)<£{x) 



Lo, 



if y = SiX and £{y) > i{x), 
ii y = SiX and £{y) < £{x), 
iiy = x, 
otherwise. 



(2.5) 



6 



PERSI DIACONIS AND ARUN RAM 



which has stationary distribution tt. In Section 4a we demonstrate that this is exactly the chain 
given by left multiplication by a uniformly chosen generator Ti in the Iwahori-Hecke algebra H with 
q = 9~^. Similarly, the systematic scan chain of Theorem 1.4 can be interpreted via multiplication 
in H. 

Despite its widespread use there has been very limited success in analyzing the time to sta- 
tionarity of the Metropolis algorithm. In the present paper we carry this out for the random scan 
Metropolis algorithm (2.5) on the hypercube (Section 5) and on the dihedral group (Section 6). 
Though we have not analyzed the random scan Metropolis algorithm on the symmetric group we 
conjecture that order steps arc necessary and sufficient to achieve stationarity. A survey of 
what is rigorously known appears in [DS]. 

Diaconis and Hanlon [DH] studied the example given by W = S'„, the symmetric group, (so 
c{w) = n — [# of cycles in lu]), with input chain 

{-^, if v = (i, j)x for some transposition (i, j), 
0, otherwise, 

and stationary distribution 7r{w) = z9'^^'^\ where z is a normalizing constant and c{w) is the 
minimum number of transpositions needed to sort w. They showed that all eigenvectors of the 
resulting Metropolis chain are given by the coefficients of Jack's symmetric functions (expanded in 
terms of the power sum symmetric functions) and they used the corresponding eigenvalues to give 
a complete analysis of the running time. 

Similar analyses were carried out in the Ph.D. theses of Belsley [Be2] and Silver [Si]. They 
worked in abelian groups with vr proportional to where I is the length function with respect 
to a natural set of generators. In several cases they found that the eigenfunctions were natural 
deformations of classical orthogonal polynomials. Ross and Xu [RX] studied the random scan 
Metropolis algorithm on the hypercube using its representation as a random walk on a hypergroup. 
It should be emphasized that for other choices of vr, or in non-group cases, careful analysis of rates 
of convergence for the Metropolis algorithm is completely open. 

2d. Some other interpretations of the walks 

We have presented Theorem 1.4 in an algorithmic context. Here we show how the walk (2.5) 
arises geometrically on the space of flags and as the natural nearest neighbor walk on the chambers 
of a building. The systematic scan walks have similar interpretations. 

Let ¥g be a finite field. A complete flag F = {0 = Fq C Fi C F2 C ■ ■ ■ C C F„ = V) 

is a nested increasing sequence of subspaces of an n-dimensional vector space V over ¥q with 
dim(Fj) = i. A natural random walk on complete flags may be performed as follows: 
Choose i, 1 < i < n — 1, uniformly. 

Replace Fi by a uniformly chosen subspace Fj with C Fj C Fj+i. 

This walk is symmetric, irreducible and aperiodic. It thus has the uniform distribution as its 
unique stationary measure. It is instructive to think of the "g = 1" case. Then a flag is a nested 
increasing chain {ii} C {zi,i2} C • • • C {ii,i2, . . . ,in} of elements of an n set or, equivalently, a 
permutation {ii,i2, . . . ,in)- In this case the walk is multiplication by random pairwise adjacent 
transpositions. 

The space of flags may also be identified as the chambers of a building of type A^-i and in 
this formulation the walk is described as follows: 

Prom a chamber C of the building choose one of the adjacent chambers uniformly at 
random and move there. 
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In the elegant, readable treatment of buildings by Brown [Bw], he explains that flag space may 
be represented as G/B with G = GLn(¥q) and B the subgroup of upper triangular matrices in 
GLn{¥q). Then two flags g^B and (72-B differ in the ith step as above if and only if giPi = g2Pi 
where Pj is the parabolic subgroup Pi = B\J BsiB (see [Bw, pp. 102-103]). Thus, if flags g\B 
and g2B are z-adjacent then g2 = gib or g2 = gibsib' with b,b' G B, and so the walk on G/B 
moves from gB to gg'B with g' uniformly chosen in B or BsiB. In this way, choosing an adjacent 
chamber of the building at random produces a S-invariant walk on G/B. Finally, the walk on flags 
gives rise to a natural walk on the double coset space B\G/B (described in more detail in Section 
3b). The double coset space is identifiable with the symmetric group S'„ and the induced Markov 
chain is given by (2.5) with 9 = 1/q. A similar story holds for the natural walk on any spherical 
building. 

2e. Properties of the stationary distribution 

Suppose that (X, d) is a finite metric space. A simple way of building probability models on 
X is to fix < < 1 and £ X and define 

7r(a;) = ^^, where q = 9'^ and P^{q) = ^ q'^^''''"'^ (2.7) 

is a normalizing constant. When g = 1 the distribution is uniform. 

Models of the form (2.7) were introduced by Mallows [Ma] for the study of permutations. He 
used the length function as a distance, i{x~^xo) = d(x, Xq), and estimated q and Xq to match data. 
Such Mallows models have had application and development for ranked and partially ranked data 
using a variety of metrics [D], [Cr], [FV], [Mar]. They have also been used for phylogenetic trees 
[BHV], classification trees [SB] and compositions [Det]. 

One problem in studying Mallows models is that the normalizing constant P^ (q) is uncom- 
putable in general. In such cases properties of tt can be studied by simulation using the Metropolis 
algorithm given in Section 2c. 

For the examples based on reflection groups the normalizing constants are known and further 
there is a simple algorithm available for exact generation from vr. These properties are collected 
together here. In each case the properties are illustrated for the permutation group; some of our 
results are new for the original Mallows model. Further, the properties of tt (particularly Property 
4 below) are used in proving the lower bounds in Theorem 1.4. 

Throughout this section we work with the model where the underlying space X = W is a 
flnite Coxeter group generated by simple reflections, xq is the identity element of W, and the 
length function is the distance on W. Thus the model is 

T,{w) = ^— where q = 9-^ and P^(g) = (2.8) 

is the Poincare polynomial of the group W. It is a classical theorem that the normalizing constant 
has a simple form: 

i=l 

for known integers dj, the degrees of W (see [Hu, Theorem 3.15]). For the symmetric group Sn+i, 
di = i + 1 for 1 < i < n. The Poincare polynomial P^ (q) will be used crucially in what follows. 

Property 1. tt{w) = 'k{w~^), since £{w) = £{w~^). 
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This invariance under inversion was first used by Mallows [Ma] to characterize Mallows models 
in a larger class of measures as follows: Suppose n objects are to be ranked by making pairwise 
comparisons. Suppose the true ranking isl<2<3<---<n and a subject ranks objects i and j 
correctly with probability . Let Q{w) be the chance that the comparisons lead to the permutation 
w given that they are all consistent. Of course, Q{w) depends on the (2) parameters pij. Mallows 
proved that if Q{w) = Q{w~^), then for some real numbers q and 0, Q{w) = zq^^"^^ (f)'^^'^^ with 
r{w) = ^ iw{i) and z a normalizing constant. He further showed that the two parameters q 
and were practically indistinguishable for large n and suggested setting (f) = 1, leading to the 
distribution Tr{w). 

Property 2. Let J C {1,2, ...,n} and let Wj be the subgroup of W generated by the the 
generators Si for i G J. The group Wj is a parabolic subgroup of W. Each coset of Wj in W 
contains a unique coset representative xj of minimal length [Hu, Prop. 1.10] and the probability 
of any such coset is computable via 

n{x,Wj)=q'^^^^^. (2.10) 



As an example suppose that W is the symmetric group Sn generated by si, S2, . . . , s„-i where 
Si = + 1). If J = {1,2, ... ,n — 2} then Wj is the subgroup of permutations which leave n 
fixed. The minimal length coset representatives Xj for the cosets of Wj in W have j in position n 
and the rest of the entries in order. Property 2 says that 

n{{weSr.\w{n)=j})=q-^-^^y (2.11) 

Similarly, if J = {2, 3, . . . , n — 1} Property 2 yields 

7ri{w e 5„ I wil)=j}) = Q'-'^^y (2-12) 

Similar formulas can be derived for the cases where J consists of the first j or last j elements of 
{1,2,. 

In combination with Property 1, (2.11) also provides a fomula for the probability of the set of 

permutations with j in the n**^ position and (2.12) gives the probability of the set of permutations 
with j in the first position. More generally, one can give formulas for the probability of the set of 
permutations which have 1, 2, . . . , j in any given relative position. 

Property 3. Let Ji D J2 D • • • D Jfc = be a sequence of subsets of {1, 2, . . . , n}. Then a 
sequential algorithm for generating w in W from tt is to choose, for each 1 < i < k—1, the minimal 

length coset representative of a coset ofWj._^_-^ in Wj., 1 < i < k — 1 and multiply these together. 
If Xi is a minimal length coset representative of a coset in WjjWj^^^^ choose Xi with probability 

As an example, suppose W is the symmetric group Sn generated by si, S2, • • • , ^n-i- If Ji 2 
J2 D . . . D Jn-i is given by = {i, i + 1, . • • , — 1} the algorithm can be realized as the following 
sequential procedure: Place symbols down sequentially beginning with 1. If symbols 1, 2, . . . , i — 1 
have been placed in some order, place i first with probability q^~^{l — q')/(l — 5*), second with 
probability q^~'^{\ — q)/{l — g*), . . ., i^^ with probability {1 — q)/{l — q^). Continuing until all n 
elements are placed gives an efficient method of choosing from tt. 
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An application of this is the following clever algorithm suggested by Pak [Pa] for generating 
a uniformly chosen element of GL„(Fg). Choose w ^ Sn with probability proportional to (7^^"'). 
Then form biwb2 with 61 and 62 uniformly chosen in the lower triangular matrices in GL„(Fg). 
This yields an efficient algorithm for uniform choice in GL„(Fg). With obvious modifications this 
procedure easily adapts to the other finite groups with a BN pair. 

Property 4. Consider a finite Coxeter group with probability distibution tt as given in (2.8). Let 
Z be the random variable given by Z{w) = £{w) for w G W. Then, with di as in (2.9), 

EAZ) = ^-±^, and Var.(Z) = -^-V-^i4^I• (2-13) 



Proof. The moment generating function of Z is 

It follows that Z is the sum of independent random variables Zi, . . . , Z^, where 



Then 



and 



dt^ l-q (l-g)2 l-qdi 

It follows that 

q d^q'^' 



Yav^iZi) 



(l-g)2 



di\2- 



We remark that for Coxeter groups of type A^, Bn, Dn under the probability distribution vr, 
i{w) has an approximately normal distribution with mean and variance as in (2.13). This follows 
from its representation as a sum of independent variables in the proof of Property 4. For details, 
see [D, Ch. 6C, Cor. 1-2]. 

3. Hecke algebras 

This section introduces Hecke algebras as bi-invariant functions on a group. We develop 
the needed Fourier analysis and then specialize to the Iwahori-Hecke algebras associated to finite 
Coxeter groups. 



3a. Algebras and Fourier analysis 
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Random walks are traditionally analyzed using Fourier analysis [D] . We find this possible in 
our examples and here explain the basic tools. 

An algebra H over C is (split) semisimple if it is isomorphic to a direct sum of matrix algebras. 
This means that there exists a finite index set W, and positive integers d\, \ e W, such that 

where Ma^ (C) is the algebra of dx x dx matrices with entries in C. Fix an isomorphism 

(f>:H — ^0Md^(C) and define e^^ = 4>~\E^t), XeW, 1 < S,T < dx, (3.1) 

A 

where Egrp is the matrix in M(i^{C) which has a 1 in the (5", T) entry of the Ath block and 
zeros everywhere else. The elements e^rp G H are a set of matrix units for H. 
The matrix units {e;^'^} form a basis of H and we write 

h=Yl E PsT{h)e^ST, (3.2) 

for h e H. The homomorphisms p^:H^ Md^{C) and the linear functionals Xh'- ^ ^ given by 

P\h) = (p^T(^))i<5,T<ci. x^(^) = Tr{p\h)) 

are the irreducible representations and the irreducible characters of H, respectively. The homo- 
morphisms depend on the choice of (f) but the irreducible characters ^o^- 

A trace on if is a linear functional t: H C such that t{hih2) = t{h2hi), for all /ii, /i2 £ H- 
Up to constant multiples there is a unique trace on Mrf^(C) and this implies that for any trace 
t: H ^ C on H there are unique tx & C, X G W, such that 

i= E *AX^ (3.3) 

The trace t is nondegenerate iitx^^ for all A G W. Define a symmetric bilinear form HxH — >■ 
C on if by 

{hi,h2) H = iihihi)-, for /ii, /i2 G if. 

The form {,) ^ is nondegenerate if and only if t is a nondegenerate trace. 

Let be a basis of H. The Fourier transform of /t = Ylfwew ^wTw at the representa- 

tion p is 

h{p) = h^PiTu,). (3.4) 

The Fourier inversion theorems describe the change of basis matrix between {T^} and {e^/j,} and 
recovers h from {h{p^)} -^^y^ . 

Theorem 3.5. (Fourier inversion and Plancherel) Let H be a semisimple algebra over C witii a 
nodegenerate trace t. Let {T^}wew be a basis for the algebra H. Let {T^j^jew be the dual basis 
with respect to that is {T^,Ty)^ = dyjy. Then, with notations as in (3.1-3.4), 



^ST — 



wew 
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and, for any h, hi,h2 G H, 

h^=Y^ *ATr(/i(p^)/(f;)), for heH, and (3.6) 

{hi, h2)^=Y^ txTr (h^{p^)h2{p^)) , for h^M ^ H. (3.7) 

xew 

Proof. Since t is nondegenerate, the equation t{e^j.) = "^^^^w (^st) — ^x^st implies that 

is the dual basis to {65^} with respect to {j)^- 

By (3.2), p''sT{a) = ^{a,e^s)H- Thus 

wew wew 

The equation (3.6) is 

xew xew 

and (3.7) is 

(/il,/i2)„ = t{hih2) = Y.^Xxl{hih2) = J]iAlV(/ll(p^)/l2(p^)). I 

A A 

3h. Coset chains and Hecke algebras 

Let G be a finite group, B a subgroup of G and let Q be a left S-invariant probability 
distribution on G. Right multiplication by random picks from Q induces a random walk on G, 

Xo = xo,Xi = xogi,X2 = XQgig2, (3.8) 

which, in turn, induces a process on B cosets, Yo,Yi,Y2, . . ., where Yi is the coset containing Xi. 
The chain on G produced by right multiphcation by random picks from Q is K{x,y) = Q{x~^y). 
The chance that this chain winds up in an element of yB is K{x, yB) = Q{x~^yB) and, since Q is 
left i?-invariant, K{x, yB) = K{xb, yB) for any b E B. This invariance is a necessary and sufficient 
condition for the induced coset process to be a Markov chain for any starting state xqB G G/B 
(see Theorem 6.32 of [KS]). If the support of Q is not a coset of a subgroup of G then the chain in 
(3.8) is irreducible and aperiodic with uniform stationary distribution. The resulting coset chain 
is 

K{xB,yB) = Q{x~^yB) with stationary distribution it[xB) = \B\/\G\. 

If the probability Q is S bi-invariant then the right process (3.8) on G induces a process on B 
double cosets by simply reporting which double coset the element X^ is in. The chance the G-chain 
moves from x to an element of ByB in one step is Q{x~^ByB) and since this only depends on the 
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double coset of x the induced process is a Markov chain on double cosets for any starting state 
BxqB. Letting W he & set of coset representatives for the double cosets of B in G, the chain is 
given by 

K{w,w') = K{w~^ Bw' B), with stationary distribution tt{w) = \BwB\/\G\, 

where we view the double coset chain as a Markov chain on the set W. 

The Hecke algebra of the pair {G, B) is the subalgebra of the group algebra of G consisting of 
the B bi-invariant functions on G, 

H = {f:G^C\ fig) = fihgh), for 5 G G, 61, 62 G B}. 

Background on Hecke algebras may be found in Curtis and Reiner [CR, §11D]. If H is commutative 
then (G, B) is called a Gelfand pair and there is a well developed probabilistic literature surveyed 
in [Lel-2] or [D, Ch. 3F]. 

Let W be a set of representatives of the double cosets in B\G/B. The functions 

form a basis of H, where 5bwB is the characteristic function of the double coset BwB. The 
natural anti-involution on G given hy g g~^ induces an anti-involution *:H ^ H on H given 
by Tyj I— >■ T^-i . The trivial representation of G restricts to the index representation of H given by 

\BwB\ 

p^{T^) = md(w), where ind(w) = ' ' . (3.9) 

\B\ 

An example to keep in mind is G = GL„(Fg) with B the subgroup of upper triangular matrices. 
Then W is the set of permutation matrices and ind(;i;) = q^^'^h 

Let L{G/B) = {/: G ^ C I f{g) = f{gh) for 5 G G, 6 G S}. The group G acts on the left of 
L{G / B) and H{G/B) acts on the right of L{G/B) by convolution. The raison d'etre for the Hecke 
algebra is that H = Fiudc {L{G / B)) and, as [G. H) bimodules, 

L{G/B) =^G^® H^, (3.10) 
\ew 

where A runs over an index set W of all the irreducible representations of H, G^ is an irreducible 
G-module and is an irreducible H module, see [CR, (11.25) (ii)]. Ccntralizcrs of the action of 
a finite group, in this case G acting on L{G/B), arc semisimple (our base field is C) and therefore 
theory of Section 3a applies to Hecke algebras. The trace of the action of H on L[G/B) is given 

The decomposition (3.10) yields 

t= Yl *AX^, where tx = dim(G^), (3.12) 

and Xh irreducible characters of H. Define an inner product on H by 

{hi,h2) H =t{hih2), hi,h2eH. 



The basis 



T - 



is the dual basis to {T.u,}^^^r (3.13) 

wew 



md{w 

with respect to {,)h, i.e. ((l/ind(f ))r^-i, T^);, = c^^wj for all v,w G W (see [CR, (11.30)(iii)]). 



Sc. Iwahori-Hecke algebras 
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The Hecke algebras associated to finite Chevalley groups G and their Borel subgroups B have 
a remarkable structure theory for their double cosets - they are indexed by the elements of a finite 
Coxeter group W . For example, in the case of the group G = GLn(^q) and its Borel subgroup 
B of upper triangular matrices, the group W is the symmetric group. There are many wonderful 
references for this material; Brown [Bw], Bourbaki [Bou, Ch. IV §2 Ex. 22-27]. Curtis and Reiner 
[CR, §67-68] or Carter [Ca, §10.8-10.11]. We develop what we need in this section and give the 
relation to probability theory. 

Let W he a finite Coxeter group generated by simple reflections si, . . . These define a 
length function with ^(id) = 0, i{si) = 1, and £{siw) = £{w) it 1 for each w G W, 1 < i < n. The 
Iwahori-Hecke algebra H corresponding to W is the vector space with basis {T^ | w G W} and 
multiplication given by 

rpT -J^".^'' if ^(SiU-) = + 1, 

^'-'^-\{q-l)T^+qT,^^, if £{s,w) = £{w) - 1. ^^"'^^ 

where = T^, for 1 < i < n. When w = Si, Tf = {q-\)Ti + q or, equivalently, {Ti-q){Ti + l) = 0. 

Let W be an index set for the irreducible representations of W . For each A G Fl^ let 
be the corresponding irreducible character of W and let dx = x^(l) be the dimension of this 
representation. The irreducible representations of the Iwahori-Hecke algebra H are in one-to-one 
correspondence with the irreducible representations of W in such a way that if is the character 
of the irreducible representation of H indexed by A G W then 

for all w G W, see [CR, (68.21)]. In particular, the dimension of the irreducible representation of 

H indexed by A is d\. 

Define a trace t: if ^ C on if by 

is the Poincare polynomial of W . Then t is the trace on H given by (3.11) and the generic degrees 
are the constants t\ defined by 

t=Y.t>^xl- (3.15) 

where x^, A G W , are the irreducible characters of H (see (3.2)). li {,) H x H ^ £, is the inner 
product on H given by (/ti, /t2)jj = ^(^1^2), for all ^1, G H, then 

{T^,Ty-i)^ = S^yq'^y^P^iq), for all x,yeW, (3.16) 

see [CR, (68.29)]. 

The "trivial" representation of the Iwahori-Hecke algebra H is the one-dimensional repre- 
sentation corresponding to the trivial representation of W. For w G W, 

p\T^) = xliTn,) = q'^'^K and vr = J] 
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is the corresponding minimal central idempotent of H (cf. (3.9)). Since ti = 1, 

t{hTT) = tix^{h), and T^vr = g^^^'V, (3.17) 

for alike H and w G W, see [CR, (68.23) and (68.28)]. 

Let tr be the trace of the regular representation of H, i.e. tr(/i) is the trace of the linear 
transformation obtained from the action of h on H by left multiplication. Then 

tr=J]dAX^ (3.18) 
xew 

where d\ are the dimensions of the irreducible representations of H (see [CR, (3.37)(iii)]). 

Both traces tr and t are important in our analysis of Metropolis walks (see, for example, the proof 
of Proposition 4.8). 

4. Metropolis walks and systematic scans 

This section brings together previous results in the form needed to prove our main theo- 
rems. Wc show that the various systematic scans arc precisely represented as multiplication in 
the Iwahori-Hecke algebra. Then representation theory yields tractable expressions for the norms 
involved. 



4a. Metropolis walks on W 

Let W he a finite Coxeter group generated by simple reflections Si,...,s„ and, for each 
1 < i < ra, let Pi{x,y) be the Markov chain on W given by 



'^^ \0, otherwise. 



Fix 6, < 9 < 1, and let tt be as in (1.1) Then the Metropolis construction produces the Markov 
chain 

1, if 2/ = SiX and £{y) > i{x), 

6, if y = SiX and £{y) < £{x), (4.1) 
1 — 0, if y = x. 



Ki{x,y) 



The chain Ki can be interpreted as follows: 

From w;, try to multiply by ,Sj. If this increases the length, carry out the multiplica- 
tion. If it decreases the length flip a ^-coin. If the coin comes out heads carry out the 
multiplication. If it comes up tails the chain stays at w. 

Of course, the chain based on a fixed value of i is not irreducible. However, any convex linear 
combination and any symmetric product of reversible Markov chains with a fixed stationary dis- 
tribution is reversible with the same stationary distribution. If W is the symmetric group then the 
following chains are reversible for tt: 



1 " 

EKi (random scan Metropolis) 



1=1 



K1K2 ■ ■ ■ KnKn ■ ■ ■ K2K1 (short systematic scan) i'^-'^) 



{K1K2 ■ ■ ■ KnKn ■ ■ ■ K2K1) ■ ■ ■ {KiK2K2Ki){KiKi) 



(long systematic scan) 
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Note that K1K2 ■ ■ ■ is an irreducible Markov chain with tt stationary. However, it is not 
reversible in general. 

The following Theorem (which follows directly from our setup) shows that many Markov chains 
on W can be obtained by left multiplication by elements of H on the basis {T^}. The remaining 
results in this subsection provide the necessary tools for studying the convergence of these chains 
by using the representation theory of the Iwahori-Hecke algebra H. Though we have chosen to 
focus here on the Iwahori-Hecke algebras related to finite reflection groups W, the results of this 
section hold in a general Hecke algebra context. 

Theorem 4.3. Let W be a finite Coxeter group and let H be the Iwahori-Hecke algebra with 
basis {Tyjj^^w as dehned in (3.14). Let 

q = e-\ fi = Ti/q, and = g'^^^'^T^, forweW. 

The Metropolis chain Ki in (4. 1 ) is the same as the matrix of left multiplication by Ti with respect 
to the basis {T^jwew of H: 

f.f ^(Tsiw, if£{siw)>e{w), .^ 

' - \ (1 _ 0)f^ + ^r,,^, if£{siw) < i{w). ^ • ' 



Identify functions /: X ^ M in L'^{'k) with elements of the Iwahori-Hecke algebra H via 

f=Y, f{^)T.. (4.5) 
xew 

The following Proposition shows that we can use the inner product (, )^ on (defined in Section 
3c) to compute norms in L^{7r). Coupled with Lemma 2.3 it gives bounds on rates of convergence 
in total variation distance. 

Proposition 4.6. Let W be a finite Coxeter group and let tt be as in (1.1). With the identification 
of L'^('k) and the Iwahori-Hecke algebra H given by (4.5), 

if /t^, 9/7^). = if, gin, for all f,g G L^tt). 
wJiere *:H^His the involutive anti-automorphism of H defined by = T^-i . 

Proof. Use the notation / = J] = XI fix)q~^^''^T^ = J] ^ = 

xew xew xew 

(2.2) and (1.1) give 

xew ^ ' xew xew 

Thus, by (3.16), 

{f/'^,9/'^)2=^fx9x{Tx,Ty-i)^= f^gy{T^,Ty-i)^ = {f,g*)j,. | 

xew x,yew 
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The following lemma shows that the inner product in L'^{tt), reversibility, and the involution 
*:H^H are simply related. 

Lemma 4.7. Let H be the Iwahori-Hecke algebra corresponding to a finite real reflection 

group W and let it be as in (1.1). Let K be a reversible Markov chain on W determined by 
left multiplication by an element of H (also denoted by K). The chain K operates on L^(n) by 
Kf{x) = K{x,y)f{y). Then the following are equivalent: 
yew 

(a) K is reversible with respect to n, 

(b) K is self adjoint with respect to (, 

(c) K = K* in the Iwahori-Hecke algebra H, 

where {,).^ is the norm on L'^('k) defined in (2.2) and *: H ^ H is the involutive anti-automorphism 
ofH defined byT*=T^-i. 

Proof. If K is reversible then 

{Kf,g),= J2 K{x,y)f{y)g{x)7r{x)= f{y)K{y,x)gix)7r{y) = {f,Kg),, 

x,yeX x,yeX 

and, conversely, if K is self adjoint then 

TT{x)K{x,y) = {6a:,K5y) = {K5^,5y) = 'K{y)K{y,x), 

where Sz denotes the delta function at z, given by dz(x) = dzx (Kronecker delta). So K is reversible 
if and only if K is self adjoint. 

If K is self adjoint with respect to {,)^ then, by Proposition 4.6, 

{Kf,g*), = {Kfl-K^gl-K), = {f /-K^Kg/-,), = {f.{KgY), = {f,g*K*),. 

Thus, for all weW, {K,T^)^ = {1,T^K*)^ = {T^,K*)^ = {K*,T^)^. So K = K* . | 

The following Proposition is a primary tool for studying rates of convergence of Markov chains 
on Iwahori-Hecke algebras. It bounds the L^{7r) norm of Lemma 2.3 in terms of characters of the 
Iwahori-Hecke algebra. In contrast with the way that random walks are often analyzed (see, for 
example [DS2]) the following Proposition also shows that the Markov chain given by K can be 
analyzed without knowing the eigenvalues of - it is only necessary to compute traces. 

Proposition 4.8. Let H be the Iwahori-Hecke algebra corresponding to a finite real reflection 

group W. Let K be a reversible Markov chain on W with stationary distribution it determined 
by left multiplication by an element of H (also denoted by K). Let denote the Markov chain 
started at x after I steps. Then 

(^) \\Ki/n -l\\l = t^xliT^-^K^'T^l 

(b) Y.,T{x)\\Ki/TT-l\\l = Y,d^xl{K''), 
xew A#i 

where are the irreducible characters, t\ the generic degrees (3.15), and d\ the dimensions of 
the irreducible representations of H. 
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Proof. Equation (3.17) says t{hTr) = iiX^(/t), and T^tt = tt, for all h e H and w G W. Thus, since 
is a probability, Proposition 4.6 gives 

1 = (irf A,l), = {K'%,n), = {K''n,f,-.7r), 
Then, by Proposition 4.6, 

Thus, by (3.12) and Lemma 4.7(c), 

" xew 

Now (a) follows since {Ki/7T - 1, K^/tt -I) = {Ki/TT, K^/tt) - 1. Part (b) follows similarly from 
the following calculation. Using the definition (2.2) of the norm on L'^iir), 

}^7r{x){Kj7r,Kj7r)^= ^ 7r{x) — = 7r{y) — 

xew x,yeW x,yeW 

yew Xew 
where tr is the trace of the regular representation of H given in (3.18). | 

4b. Systematic scans 

One case of Proposition 4.8 which can be analyzed for all finite Coxeter groups W is the case 
when the Markov chain is a (generalized) systematic scan. This is when K is given by left 
multiplication by the element T^^ in the Iwahori-Hecke algebra. In terms of the geometry of the 
Coxeter group this chain is the Metropolis walk on the chambers which tries to move a chamber 
to its opposite chamber and back again by successive reflections in the walls of chambers. Since 
each step is a Metropolis step the chance that the chamber returns to its original position after 
one pass is not 1, but depends on the parameter 9. In the case when W is the symmetric group 
this chain is the long systematic scan defined in (4.2). 

Let z be the sum of all the reflections in W. Then ^; is a central element (since it is a conjugacy 
class sum) in the group algebra of W and thus, by Schur's Lemma, z acts by a constant cx on the 
irreducible representation of W labeled by A G W. The following well known result shows that the 
element T^^, where wq is the longest element of W, is an Iwahori-Hecke algebra analogue of the 
element z. From the point of view provided by Theorem 4.3 the following Proposition determines 
the eigenvalues (with their multiplicities) of the systematic scan Metropolis chain K on W. 

Proposition 4.9. Let z be the sum of all the reflections in W and let wq be the longest element 
of W. The element T^^ is in the center of the Iwahori-Hecke algebra H and 

p\f^J = q^'-'^'^^hd, where cx = ^^, 
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is the irreducible representation of H indexed by X, is the irreducible character ofW labeled 
by X, and d\ = x^(l) is the dimension of this representation. 

Proof. This result is standard, see [Ra, (2.4) and (2.5)], so we only sketch the proof here. A 
result of Brieskorn-Saito [BS] and Deligne [De] says that T^^ is in the center of the corresponding 
braid group. Since the Iwahori-Hecke algebra H is & quotient of the group algebra of the braid 
group it fohows that T^^ is in the center of H. The constant by which it acts on the irreducible 
representation labeled by A can be checked as follows. The element T^^ — q^^'^o^ is divisible by 
{q — 1) and 



Wo 



q-1 



Since ^ 



i(uio)+CA _ Mwo) 



9=1 q-l 



= c\, 

9=1 



and z acts by the constant cx the element T^^ must act by the constant q^('^o}+cx_ r^^ie result of 
the Proposition now follows since T^^ = q~'^^^^°^T^^. An alternative way to obtain the constant 
qe{wo)+cx -^yiiich T^^ acts by is to note that 

det(p^(T,)) = (-l)(''--x^(«^))/2^('^^+x^(«0)/2, 
for all 1 < i < n, and this and [Bou, Ch. VI §1 Cor. 2] imply that 

det{T^ ) = q'^^(-^o)dx+2x^{z))/2 ^ qdxieiwo)+cx) _ | 

Combining Propositions 4.9 and 4.8 gives the following bounds on the convergence of the 
systematic scan Metropolis walk on a finite Coxeter group W. Explicit analyses of these bounds 
in examples are given in Sections 5,6,7. 

Theorem 4.10. Let H be the Iwahori-Hcckc algebra corresponding to a finite real reflection group 
W. Let K be the systematic scan Metropolis chain on W , i.e. the reversible Markov chain on W 
with stationary distribution tt determined by left multiplication by the element T^^ of H where 
Wo is the longest element ofW. Then 

(a) \\Ki/7r -l\\l = Yl txdxe^'^'^^°^-'^^\ 

(b) y: A^)\\Ki/^ - = E 

xew A#i 

where £{wo) is the length of wq, t\ are the generic degrees (see (3.15)), d\ the dimensions of the 
irreducible representations of H, and the constants c\ are as given in Proposition 4.9. 



5. The hypercube 

We begin with a simple but instructive example where all details can be carried out. We are 
able to analyze and compare both randomized and systematic scans. We show that both kinds of 
scans take order n log n operations to converge to stationarity. For small values of 9 the systematic 
scan converges faster and for 9 close to 1 the random scan converges faster. 

5a. Preliminaries 
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The Coxeter group W = (Z/2Z)"' has generators si,S2, ■ ■ ■ ,Sn and relations 

= 1, SiSj = SjSi, for all 1 < i,j < n. 

The set X = W = (Z/2Z)"' is the space of binary n-tuples, Si is the vector with 1 in the ith 
coordinate and elsewhere, and the length function is given by £{x) = \x\ = (# of ones in x). The 
longest clement of W is Wq = S1S2 ■ ■ ■ •Sn and i{wo) = n. 

The irreducible representations of the hvahori-Hecke algebra of (Z/2Z)" are all one- 
dimensional and are indexed by n-tuples A = (Ai, . . . , A„), Aj G {0, 1}. Let |A| = Ai + • • • + A„. 
Then 

pA(r,) = |^^^ CA = n-2|A|, and = q^'\ (5.1) 

where cx and tx are the constants defined in Proposition 4.9 and (3.15), respectively. 
Fix < ^ < 1 and let 

is a normalizing constant. Then tt is a product measure on (Z/2Z)" since 



5b. Random scan Metropolis 

The random scan Metropolis algorithm proceeds by choosing a coordinate at random and 
attempting to change to its opposite mod 2. If this results in a one, the change is made. If the 
change in a zero, flip a coin with parameter 9. If the flip comes up heads change the chosen 
coordinate to and if it comes up tails then the coordinate stays at 1. The resulting chain is 

(1/n), ife{y)=£{x) + l, 

il/n)e, if £{y) = l{x) - 1, 

{i{x)/n){l-e), iiy = x, 

L 0, otherwise. 



K{x,y) 



(5.3) 



The following theorem shows that order n log n steps are necessary and suflacient to reach station- 
arity. 

Theorem 5.4. Let the random scan Metropolis algorithm on (Z/2Z)" be defined by (5.3) with 
< 6 < 1. Then, for any starting state x, and any I, 



K'Jt, - 1||2 = 5^ ^2A.x-|A| (^1 - ^(1 + 0)^ 



2£ 



A#0 

For < 6* < 1 and ^ = n(logn - log6' + c)/2(l + d) with c> 0, 



\Ki-n\\l^<{e^-'-l)+e- 



c/2 



(5.5) 



(5.6) 
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The bound in (5.6) is sharp in the sense that if £ = n(logn — logO + c)/2(l + 0) then for all e > 
there exists a c < such that \\Kq — 7r||^^ > 1 — e for all sufRciently large n. 

Proof. Prom the definitions of the irreducible representations of H, 

Xl (t.-. ((^) (Ti + • • • + f„))2^r.) = n-2^g-2^(-)-2^x^ ^ + • • • + T^f'xl {T^f 

= n- V ((n - \X\)q - \\\f' g2(|x|-A.x)^-2|.| 
= (l-(|A|/n)(l + ^)f^02A.x_ 

The first statement then follows from Theorem 4.10(a) with the value for t\ given in (5.1). For the 
second statement we need to bound the sum on the right hand side of (5.5). Since 6* < 1, 6^ '' < 6*° 
and 

II^^A - HI? sEQ"-' (1-^(1 + ")) ■ 

Break the sum at n/2. For the first half use (") < /j\ and 1 — x < to give an upper bound 

n/2 . n/2 _■ 

For the second half change j to n — k and use the same inequalities to get an upper bound 

n/2 f, n/2 

E'^nk-n -{n-k){l+e)2£/n _ ST^ _|_ fc(log n+log 6'+2£(l + 6»)/n) -n log 0-2^n(l+6l)/n 

k=0 k=0 

Using Yl^=o W — ^ for ^ > 2 the bound for the second half becomes 

g(n/2)(logn+log6'+2^(l+e)/n)-nlog6'-2€n(l+e)/n _ g(n/2)(logn-log 0-2^(l+e)/n) _ ^-nc/2 

To show that this upper bound is sharp we use the second moment method. With respect to 
the action of K on L'^{'k) defined in Lemma 4.7 the matrix K(x,y) of (5.3) has an orthonormal 
basis of eigenfunctions 

fjy) = 0-W/^(-0)^-y^ with eigenvalues 1-^(1 + 0), A e (Z/2Z)". (5.7) 

n 

Let Ci € (Z/2Z)" be the vector with 1 in the z*^ entry and elsewhere. We shall use the test 
function, 

T{y) = Y: Uiy) = e-^'^ iZi-er = e-''\n - m + e)) = ^ U- ^^^) • (5.8) 

The expectation and the variance of T with respect to the distribution tt are 

E^{T) = and Var^(r) = E^(r^) = n. (5.9) 
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For i ^ j, 



Q 

fetfcj = fei+ej, and /g. = /o + —j=-fe 



where the second identity is verified by checking that both sides agree when evaluated at each of 
the two cases: y such that j/j = 1 and y such that yi = 0. We can compute the expectation and 
variance of T under the distribution Kq as follows: 

£;,,o(r) = Y^K\^,y)T{y) = Y,{K'Um = ^ (^1 - ^ ) , and (5.10) 



j=i 



Var,,o(T) = £;,,o(n-£;^,o(T)' 

\i=i i^j J ^ ^ 

_^^ n{l-e) / i + eV ^ n{n-l) 2(1 + 9) ^ ( ^ 1 + 



n J 6 \ n J 6 \ n 

We want to use these formulas to show that £ = {n/2{l + 0))(logn — log^ + c) steps are sharp. 
Fixing k and using log(l — x) = —x — + 0{x^) and = 1 — x^/2 + 0{x^), 

when n is large. Thus, for £ = {n/2{l + 9))(logn — \og9 + c) and n large, 
Ee,Q{T) ~ ^/ne-'=/^ and 



. ' ' 'V 2n^ J e n \ 2n2 

= n + Oc,e{Vn) + Oc,0{logn), 

with the error terms depending on c and 9. By first choosing c to be a fixed (large) negative 
number and then letting n — > oo, we see that, if b is large, the set Ah = {x \ |T(a;)| < b^yn} has 
probability 1 — 1/6^ under vr and probability 0(l/fe^) under Kq. This completes the proof of the 
last statement. | 



5c. Systematic scan Metropolis 

We turn next to the systematic scan version. Order n log n steps are required here too. Lest 
the reader think this contradicts the example which begins this paper, we note that the opening 
example (which actually corresponds to the heat bath updating setup) replaces each coordinate 
with a freshly chosen pick. Thus a zero coordinate which is chosen can remain zero with probability 
9. For the Metropolis version analyzed here a chosen zero must change to a one. 
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With notation as in Section 5a, let A'' be the chain on Z/2Z with matrix l ^ ' 
(Z/2Z)'^ define Ki acting as on the ith coordinate. Let 

K = KiK2-KnK,,-Ki. (5.11) 

This is the systematic scan Metropohs algorithm with stationary distribution tt. The following 
theorem gives bounds on the distance to stationarity. 

Theorem 5.12. Let the systematic scan Metropolis algorithm on (Z/2Z)" be defined by (5.11) 
with < 6 < 1. Then, for any starting state x and any i 

\\Ki/7r-l\\l = (5.13) 

For,0 <e <1 and£= \ ( l^^^^l^f + 1 I ^^'^^ c > 0, 

4 V log(l/^) J 



Tiie bound in (5.14) is sharp in the sense that if £ = - { - — -. — 7-— + 1 ) then for all e > there 

4Vlog(l/^) ' 

exists a c < such that \\Kq — 7r||^y > 1 — e for all sufficiently large n. 
Proof. From the definitions of the ireducible representations of 

The first statement then follows from Proposition 4.8(a). Since < 1, B"^^^ < 9^, and 
\\Ki/7r - 1||2 = (1 + e'^'-Y - 1 = (1 + e(4^-i)i°g^)" - 1 



\ (e-^ - 1) . (5.14) 



( («-l)logS\" logr.+ («-l)loge 

< ( e 1 - 1 = e - 1, 



and the upper bound follows. 

The proof of the lower bound is similar to the random scan case and we only give the variants 
needed. As in (5.7) the eigenvectors of the Markov chain are 

j^{y) = Q-\>^\l'^{-Qf-v^ but with eigenvalues Q^^\^\ \e(Z/2'LY. 

Use the same test statistic T as in (5.8). The expectation and the variance of T with respect to 
the distribution vr are the same as in (5.9) and a calculation similar to that in (5.10) yields that 
the expectation and variance of T under the distribution Mg are 

Ee,o{T) = ^y\ and Var,,o(T) = n f 1 + ^ " ^-^^ ^ 



For i 



41og(l/e) 
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E,,o(r) = and Var,,o(r) = n[l + ^^^'"'^ ' ^e"^ ) = n(l + Oe,e(l/V^)). 



This allows the argument to conclude as in the proof of Theorem 5.4. | 



After £ passes the systematic scan algorithm makes 2ln basic steps. Thus the results of 
Theorems 5.4 and 5.12 show that both scanning strategies converge in order nlogn basic steps. 
The following table compares the lead term constants for the two scanning strategies at various 
values of 6. 

e 



general 
1 



1 + e 
1 

2 



random 


systematic 


nlog(n/^) 


nlogn 


2(1 + 0) 


21og(l/0) 


nlogn 


nlogn 


4 


21og(l + e) 


n log 2n 


nlogn 


3 


2 log 2 


nlog(n/e) 


nlogn 


2 


21og(l/e) 



We see that the lead term constants make the random scan faster as 9 ^ 1 while the systematic 
scan is faster as ^ 0. 



6. The dihedral group. 

The hypercube of Section 5 is commutative. This section treats the simplest noncommutative 

example - the dihedral group D2n- We completely analyze the convergence of both the randomized 
and systematic scans. We find that both scanning strategies take order n operations to converge 
to stationarity. 



6a. Preliminaries 



The dihedral group of order 2n is the group W given by generators Si, S2 and relations 



s\ = 1, and siS2S\ ■ ■ ■ = S2S1S2 



n factors n factors 



This is the group of symmetries of a regular 2n-gon where Si and S2 act by reflection in axes 
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through the center of the 2n-gon which form an angle of 27r/2n. 




Fix < 9 < 1 and consider the distribution on W given by 

This measure is largest at the longest element of W, wq = S1S2S1 • • • (n factors), and £{wo) = n. 
The walks to be analyzed will all start at the identity. 

One may picture the walks described in this section on the 2n chambers of an n-gon. Pick 
one chamber (labeled with identity) and identify the two internal sides with si, S2- Reflecting the 
fundamental chamber around gives each edge and each chamber a label. The distance £{w) is the 
smallest number of chambers required to walk from the chamber labeled by w to the identity. For 
example, in -D12, pictured above ^(siS2SiS2) = 4. 

The random scan Metropolis walk proceeds from w by choosing one of Si, S2 with probability 
1/2. lil{s iw) > £{w) the move is accepted. If i[siw) < i{w) the move is accepted with probability 
8 and rejected with probability 1 — 9. 

One pass of the systematic scan Metropolis algorithm chooses n generators in the order 

si,S2,si,S2, Geometrically, starting from the identity, this amounts to marching around the 

n-gon. If no rejections are made one complete scan ends in wq. 

Our bounds result in explicit expressions for the convergence of the two walks. One of these 
has been carefully analyzed by Belsey [Be, Ch. VI Thm. 2-10]. He showed 

Proposition 6.1. For the random scan Metropolis algorithm starting from the identity 



77- log B 

For < 6 < 1 the right hand side of (6.2) is small for k of order 



21og(l-(l/2)(l-V^)2)- 
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Belsley [Be, Ch. VI Thm. 4-12] further shows that the random scan Metropohs algorithm has a 
total variation cutoff at 

i = ^ + cVn- 
For the systematic scan algorithm our results show 

^ \ 9 — 19 — 1 J 

and so, when £ = 1, 

ll^j/._,lj<r(^.^-i).H^. (6.3, 

Thus, for large n and fixed < < 1, a single scan suffices to achieve randomness. 

A comparison of the results in (6.2) and (6.3) shows a mild advantage for systematic scans. 
The effect is most pronounced as 9 approaches 1. 

6b. Representation theory 

References for the statements in this paragraph arc [KSo] and [CR, §67C]. The two dimensional 
irreducible representations of the Iwahori-Hecke algebra H are indexed byO<A<n/2 and are 
given explicitly by 

/(Tr) = .^/^(-5 ^'_t;5), and /(T.) = ,V2^« l + (6.4) 

where 

^^g27riA/n^ and the cquatious a + d = q^^^ - q'^^^ and - aC^ + = q^^^ - q'^^^ 
determine a and d. For these representations 

e,.o and + (6.5, 

n [q-l) {q-Oil-^ ) 

If n is odd there are two one-dimensional representations of H, the "trivial" representation and 
the "sign" representation, 

p^(Ti) = p^{T2) = q, with Ci = n, and ii = 1, 

p'^''{Ti) = p'^''{T2) = -l, with Csg„ = -n, and isg„ = 

If n is even there are two additional one dimensional representations of H, 

P'^iTi)=q, . p-{T^) = -l, 

4-. X and , ^ 

p+(T2) = -l, p-{T2)=q, 

with 

c+ = c_ = 0, and i+ = i_ — 



n(g2 - 1) • 
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The next Proposition computes the traces on the Iwahori-Hecke algebra which are needed in 
order to use Proposition 4.8 to give upper bounds for the convergence of the random and systematic 
scan Metropohs walks on the dihedral group. 

Proposition 6.6. Let W be the dihedral group of order 2n and let wq be the longest element of 
W. Let e = q-^ and = g-^^^'^T^ as in Theorem 4.3. If is the character of a two dimensional 
representation of the Iwahori-Hecke algebra H then 

X',{{f^,r)=2e^'-, and 
For the one dimensional irreducible characters of H, 

xl (((i/2)(fi + f2f^) =1, xT{i(y^)(Ti + f2)f')=e'', 

and, if n is even, 

xt{{f^y)=X-{{f^,r)=0''-, and 

xt [{{i/2){n + f2)r)=x- [{{l/2){f, + f,)r) = 



Proof. The formulas for ((T^^)^^) follow from Proposition 4.9. The results for the one di- 
mensional representations are easy consequences of the definitions of the representations p^. Let 
us compute x^((Ti + T2)"^) for the two dimensional representations. Prom the definition of the 
representations in 6.4 

TV (/(Ti+r2))= 2^1/2(^1/2 _ ^-1/2) 

and 

det(/(ri + T2)) = q{ia- d^id - ar') + (1 + r')(l + ad)il + 0) 
= q {{a + d.){-ar' - - (1 + T + 0) 

where ^ = e^'^*'^/". Thus the characteristic polynomial of p^{Ti +T2) is 

= {t- gV2((,l/2 _ ^-1/2) + (^1/2 + ^-1/2)) j _ ^l/2((^l/2 _ ^-1/2) _ (^1/2 + ^-1/2))^ _ 

This determines the eigenvalues of p^{Ti +T2) and the results follow after substitution of g = 
and fi = q-^Ti. | 

The next two theorems give explicit expressions for the norms \\K{/'k — Ijl^ for the random 
and systematic scan Metropolis walks on the dihedral group. Bounds on rates of convergence based 
on these expressions appear in the summary of results in Section 6a above. 
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Theorem 6.7. Let K be the systematic scan Metropolis algorithm on the dihedral group defined 
by multiplication by the element T^^ = T1T2T1 • • • (2n factors) in the Iwahori-Hecke algebra. Then 

(a) \\K(/7T -lf = ^(^^-1)" + ^(^^-i)'^ ( ■ -A- and 

\ — 1 u — 1 J 

(b) TT{x)\\Ki/TT - = e"^"" + {2n - 2)^2^". 
Proof. From (3.15) we have that 

E^AdA = P^{q) = {l + q) (j^) = (^""(^ + 1) (yfr) • 
From Theorem 4.10(a) 

ll^lVvr - 1||^ = E tM^'^'^-^^-'^^ = txd^O^'A - tM^'- + 4gn4gn(e^'" - e^'-) 

A#l \ A / 

Similarly, use the fact that Ylix^w ~ obtain 

xeW A#l \ A / 

= 02^"2n - ^2^" + 0^^" - ^2^" = + (2n - 2)^^^". | 



Theorem 6.8. Let K be the random scan Metropolis algorithm on the dihedral group defined by 
multiplication by the element (1/2) (Ti + T2) in the Iwahori-Hecke algebra. Then 



l — n / q2 1 \ / on 1 



n \ e -1 \ e -1 



op 

2 - 2cos(27rA/n) f9 + 2cos{7rX/n)9^/^ - V 



^ oSn ~ ^ cos(27rA/n)^ + 1) 
and 

xeW 0<A<n ^ ^ 



Proof. Substituting q = ^ in the formulas for tx gives t. 



sgn 



2,i_„/e"-l\ , 0i-"(^ + l) 2- 2cos(27rA/n) /^-l" 

' ' and tx- \ I J I 



n 



e^-lj n {6^ -2cos{27TX/n)e + l) \ e-1 
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for < A < n/2. Thus, ii K = (l/2)(f'i + fa), Theorem 4.10a gives 



A#l ^ ^ ^ ^ 0<A<n/2 

^ ^ ^ ^ 0<A<n/2 

where the first term appears only if n is even. The result now follows from Proposition 6.6. 
Similarly, 

2i 



+ 2cos(7rA/n)0i/2 - T 



32^ 



0<A<n 



0<A<n/2 

2£ 



7. The symmetric group 

This section proves Theorem 1.4 and a similar result for a different scanning strategy. The 
results show that both scanning strategies require in? operations up to lead term constants. 



Preliminaries 

The symmetric group Sn is generated by the simple transpositions Sj 
and the longest element of Sn is the reversal permutation 



(i,i + l), 1 < ^ < n— 1, 



1 2 

n n—1 



n — 1 n 
2 1 



with ^(ii^o) 



The book of Fulton [Fu] provides a review of the representation theory of Sn and we will adopt the 
conventions for tableaux used there. The irreducible representations of the Iwahori-Hecke algebra 
H are indexed by partitions A with n boxes. 



Number the rows and columns of A as for matrices. If A^ and A^- denote the length of the i row 
and j*^ column of A respectively, the content and the hook length of a box h in position of A 
are 

= j — i and hj, = Xi — i + X'j — j + 1, 
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respectively. Set 

= - l)Ai, and let [k\q = ^—^ and W,! = ^^[/c - 1], • • • [2],[1]„ 

for each positive integer A;. With these notations the dimensions d\ of the irreducible representa- 
tions of if, the generic degrees t\ defined in (3.15), and the constants c\ defined in Proposition 
4.9, are given by 



dA = ^, cx = Y.c{h) and tA = g"(^^.p^, (7.1) 

11 hex i\[hh\q 

bex hex 
respectively (see [Fu, §7.2 Prop. 2], [Mac, I §7 Ex. 7 and §1 Ex. 3], [Hf, 3.4.14] and [Mac, IV 
(6.7)]). The dimension dx is also equal to the number of standard tableaux of shape A, i.e. fillings 
of the boxes of A with 1, 2, . . . , n such that the rows are increasing left to right and the columns 
are increasing top to bottom (see [Fu, p.53]). 

The next lemma provides bounds on the constants in (7.1) which will be useful for proving 
bounds on the convergence times of the systematic scan Metropolis walks that we analyze here. 
The bounds on cx given in part (c) are essentially those given by Diaconis and Shahshahani, see 
[D, 3D Lemma 2]. 

Lemma 7.2. For each partition A let tx, cx and dx be as defined in (7.1). 

(a) Whene = l/q andO <9 <l, tx < oi^^')-^) dx, 

(b) ^ d\ = n!, and < -r^, for each 1 < j <n. 

Ahn ^ 

nV4-n, ifAi<n/2. 
Proof. Set e = l/q and use [Mac, I §1 Ex. 2] and [Mac, III §6 Ex. 2], to get 

tx = g-»(A)-(S)-"+(E^O ["J^' = gn(V)-(^) ^ ^-(5) y-^r(Q) 

6eA bex 

where the sum is over all standard tableaux Q of shape A and r{Q) is the sum of i such that i + 1 is 
to the right of ? in Q. Thus tx is a sum of dx monomials where the lowest degree term has degree 
n(A') - (^) > (^2^) - (^). Part (a) follows. 

We can bound the number of standard tableaux Q of shape A with Ai = — j by noting 
that there are (") ways of picking the elements not in the first row of Q and at most \fj\ ways of 
arranging these to complete a standard tableau. Thus 

2 

n\ r-\ n^^ . n^-' 



(c) Cx < 



Xi=n-j 



The inequalities in (c) are direct consequences of 

Cx < C(Ai,n-Ai), if A > n/2, and cx < C(„/2,„/2), if Ai < n/2. 

Long systematic scan 
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As in Section 4a we fix < ^ < 1 and consider the Markov chain 

!1, if y = SiX and t{y) > £{x), 
9, if y = SiX and ily) < £{x), (7.3) 
1-9, iiy = x. 

which is produced by applying the MetropoUs construction to the base chain 

'^^ \0, otherwise, 

with the distribution tt as given in (1.1). Recall that the chain Ki can be interpreted as follows: 
From w, try to multiply by s^. If this increases the number of inversions of w, carry out 
the multiplication. If it decreases the the number of inversion flip a ^-coin and carry out 
the multiplication if the coin comes up heads. Otherwise stay at w. 

The long systematic scan Metropolis chain is the chain given by 

K = {K,K2 ■ ■ ■ Kr^K^ ■ ■ ■ K2K,) ■ ■ ■ {K,K2K2K,){K,K,). 

The following theorem boTinds the rate of convergence of this Markov chain. It shows that a single 
scan suffices to be close to stationarity. 

Proposition 7.4. Let K he the long systematic scan Metropolis walk on the symmetric group 
Sn defined by (4.2). Let d\, t\ and c\ be the constants given in (7.1) Then, for < < 1, 

(a) \\KUn-l\\l= Y: t,d,e'<i"^)-^>^) , 

A#(n) 

and, with £ = 1, 

\\Kl - ttWI^ < (e"'^"'' - 1) +n!r'/W^ 
which, when 9 < 1, approaches as n ^ 00. 

(b) E-(-)II^^A-i|l^= E dle^<^"^)-^'^- 

xeW A^(n) 

and, with £ = 1, 

Y: 7r{x)\\Ki/n-l\\l^ < (e"^^" - 1) +n!rV^+", 
xew 

which, when 6 < 1, approaches as n ^ 00. 

Proof. By Theorem 4.3 this walk is equivalent to the walk on the Iwahori-Hecke algebra H defined 
by multiphcation by T^^ with respect to the basis {Tyj \ w G <§„}. Thus the equalities in (a) and 
(b) are consequences of Theorem 4.10. 

Fix £ = 1. If Ai = 77. — J and j < n/2 then the bound on c\ from Lemma 7.2(c) gives 

)-(S)+2(2)-2cA < Qj{n-j/2-l/2)-j{j-3) ^ ^j(n-3i/2+5/2) ^ ^j(n/4+5/2)^ 

and, by using the bounds in Lemma 7.2 (a-b), it follows that 

n/2 n/2 ^■ 

j = l \^(,n) j = l ^' 
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When Ai < n/2, the bound in Lemma 7.2(c) gives 

A 

Al<n/2 

The upper bound on \\Ki — 7r||^^ follows by combining these expressions. The upper bound in (b) 
is proved similarly. | 



Short systematic scan 

We now analyze the convergence of the short systematic scan and prove Theorem 1.4 of the 
introduction. The short systematic scan Metropolis chain on the symmetric group is given by 

K = KiK2---Kr,Kn---K2Ki, 

where Ki is as in (7.3). The Theorem shows that order n short systematic scans are necessary 
and suffice to reach stationarity when starting from the identity. In part (b') it is shown that for 
typical starting values this chain converges in order log n scans. 

Theorem 7.5. Let K be the short systematic scan Metropolis algorithm on the symmetric group 
defined by (4.2). Let d\, t\ and c\ be the constants given in (7.1). Then 

(a) \\K(/Tr-l\\l= iAE^''^""'"'^^^"^^^^^^^e 

A#(n) S 

the sum is over standard tableaux of shape X and S{n) denotes the box of S containing n. 
(a') For e = n/2- (log n/ log 6*) + c with c > 0, 

WK'^ _ 7r||2 < fe^'°+' - iW ^lfl'^V8-n(logn)/(log0)+n(c+l/4) 
II 1 IItv — \^ / 

Conversely, if I < n/4 then, for fixed < 9 < 1, \\Ki — irW^y tends to 1 as n ^ oo. 

(b) Y.7r{x)\\Ki/7r-l\\l= J2 dxY.e^'^^-'-<^^^^^\ where 

xeW S 

the sum is over standard tableaux of shape A and S{n) denotes the box of S containing n. 
(b') ForO<9 <1 and £ = -(logn)/(log6') + c with c> 0, 

J2 <x)\\Ki/Tr-l\\l < (e^" - l) + (^^y e'/''V2^. 



Proof. By Theorem 4.3 the Markov chain K is the random walk on {T^j \ w G Sn} defined by 
multiplication by T^-i • • • T2TIT2 • • • T^-i in the Iwahori-Hcckc algebra H corresponding to the 
symmetric group >S'„. Let H' be the Iwahori-Hecke algebra corresponding to the symmetric group 
Sn-i and let H be the Iwahori-Hecke algebra corresponding to 5„. Let w'q be the longest element 
of Sn-i and let wq be the longest element of 5„. The inclusion Sn-i C Sn induces an inclusion 
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H' Q H o( the corresponding Iwahori Hecke algebras. In H the generators Tj are invertible with 
Tr^ =q-iTi + {l-q-^). Then 

Ti2 r/i — 2 rri rri rT\2irf~\ rri 

■wo-^w'g - J-n-l - ■ ■ J-2J^\J-2 - ■ ■ J-n-l 

and so it follows from Proposition 4.9 that, in the representation of H indexed by the partition 
A, the element 

K = T„_i • • • fzT^fa • • • T„_i has eigenvalues gn-i-c{S{n)) ^ 

where S runs over standard tableaux of shape A and S{n) denotes the box of S containing n. This 
determines the eigenvalues of K'^^ in the representation p'^ and so 

S 

where Xh irreducible character of the Iwahori-Hecke algebra corresponding to the partition 

A. Parts (a) and (b) now follow from Proposition 4.8. 

Using 6)2n"-i-c(s(n))) < ^2£(n-Ai)^ ^^^j bound for tx in Lemma 7.2 gives 

\\Kf/Tr-l\\l< J2 ^('^')"(^)dAXl^'^^""^'^ ^ Yl ^i(--^i)(n-Ai+4m-2n)^2^ 
A#(n) S A#(n) 



since d\ is the number of standard tableaux of shape A. Fix £ = n/2 — (log n) / (log ^) + c. Then, 



using the bound on the sum of d\ from Lemma 7.2, 



ziij(l+4€+l-2n+4(logn/log6»)) °° a(2c+l)j ^ 

j = l A^(n) j = l j = l 

The function (n - Ai)(n - Ai + 4^ + 1 - 2n) has a minimum at n - Ai = (-1/2) (4^ + 1 - 2n). At 
this minimum Ai = n — 2 (log n)/ (log ^) + 2c+ 1/2 > n/2, and so 

^i(n-Ai)(n-Ai+4€+l-2n)^2 ^ (n/2)(n/2+4€+l-2n) ^| ^ ^|^nV8-(" log n)/(log e)+n(c+l/4) _ 

Al <n/2 

Combining these sums establishes the upper bound in (a'). 
To prove the lower bound in (a') let 

A = {weSn\iiw)>2e{n-l)} = !^weSn ^H-Q < 2(n- l)(n/4-^ 

Since each pass of the systematic scan can change the length of a permutation by at most 2(n — 1), 

Ki{A) = 0. (7.6) 

From equation (2.13), 

^ - <^ . E ^ ^ - ^ . E ^ ^ (;■) . o(„). 

j=2 3=2 j=2 ^ ^ 
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and 

Thus Chebychev's inequality implies that, when n is large, 

/ Var.(^(t.)) \ / e \ 

^^""V (2(n-l)(n/4-^))2y ~ V 4(n - 1)(1 - ^)2(n/4 - ^)2y ' ^'"'^ 

If ^ < n/4 then the right hand side approaches 1 as n ^ oo. Thus (7.5) and (7.6) imply that 
\\Kf — 7r||^^ ^ 1 as n — >■ oo and this proves the second statement of (a'), 
(b') Using the bound 0^^in-i-c(S{n))) < Q2e{n-\i)^ gj^^g 

Fix £ = — (log ra) / (log ^) + c. Using the bound in Lemma 7.2(a) gives 

i = l A#(n) j = l -^^ j = l J- 

Al =n—j 

and, by using the bound in Lemma 7.2(b) and the bound on n! given in [Fe, (9.15)], 

Al <n/2 

The result follows by combining the bounds for these two sums. | 



i-Kn. 



References 



[Ami] Y. Amit, Convergence properties of the Gibbs sampler for perturbations of Gaussians, Ann. 
Statistics 24 (1995), 122-140. 

[Am2] Y. Amit, On rates of convergence of stochastic relaxation for Gaussian and non-Gaussian 
distributions, J. Multiv. Anal. 38 (1991), 82-99. 

[AG] Y. Amit and U. Grenander, Comparing sweep strategies for stochastic relaxation, J. Mul- 
tiv. Anal. 37 (1991), 197-222. 

[AK] S. Ariki and K. Koike, A Hecke algebra of (Z/rZ) I Sn and construction of its irreducible 
representations. Adv. Math. 106 (1994), 216-243. 

[Be] E. Belsley, Rates of convergence of random walk on distance regular graphs, Probab. Theory 
Related Fields 112 (1998), 493-533. 

[Bc2] E. Belsley, Rates of convergence of Markov chains related to association schemes, Ph.D. 
thesis, Harvard, 1993. 



34 PERSI DIACONIS AND ARUN RAM 

[BF] P. Barone and a. Frigessi, Improving stochastic relaxation for Gaussian random fields^ 
Probab. Eng. Inform. Sci. 4 (1990), 369-389. 

[BHV] L. BiLLERA, S. Holmes, K. Vogtman, Geometry of the space of phylogenetic trees, Technical 
Report, Department of Statistics, Stanford University, 1999. 

[BS] E. Brieskorn and K. Saito, Artin-Gruppen und Coxeter-Gruppen, Invent. Math. 17 (1972), 

245-271. 

[Bou] N. BouRBAKi, Croupes et algebres de Lie, Ch. 4,5 ct 6, Masson, Paris, 1981. 
[Bw] K. Brown, Buildings, Springer-Verlag, New York-BerUn, 1989. 

[Ca] R. Carter, Finite groups of Lie type - Conjugacy classes and complex characters, Wiley- 
Interscience, John Wiley & Sons, Inc., New York, 1985. 

[Cr] D. Critchlow, Metric methods for analyzing partially ranked data. Lecture Notes in Statis- 
tics 34, Springer-Verlag, Berlin, 1985. 

[CR] C. Curtis and I. Reiner, Methods of representation theory - with applications to Gnite 
groups and orders, Volumes I and II, Wiley-Interscience, John Wiley & Sons, Inc., New York, 
1981 and 1987. 

[De] P. Deligne, Les immeubles des groupes de tresses generalises, Invent. Math. 17 (1972), 273- 
302. 

[D] P. Diaconis, Group representations in probability and statistics. Institute of Mathematical 
Statistics, Hayward, CA, 1988. 

[DH] P. Diaconis and P. Hanlon, Eigen- analysis for some examples of the Metropolis algorithm, 
in Hypergeometric functions on domains of positivity, Jack polynomials, and applications 
(Tampa, FL, 1991), 99-117, Contemp. Math., 138, Amer. Math. Soc, Providence, RI, 1992. 

[Det] P. Diaconis, S. Holmes, S. Janson, S. Lalley, R. Pemantle, Metrics on compositions 
and coincidences among renewal sequences, in Random discrete structures, D. Aldous and R. 
Pemantle eds., Springer, New York, 1996. 

[DS] P. Diaconis and L. Saloff-Coste, What do we know about the Metropolis algorithm?, J. 
Computer and System Sciences 57 (1998), 20-36. 

[Fe] W. Feller, An introduction to probability theory and its applications, J. Wiley and Sons, 
New York, 1968. 

[Fi] G. Fishman, Coordinate selection rules for Gibbs sampling, Ann. Appl. Probab. 6 (1996) 

444-465. 

[FV] M. FiGNER AND J. Verducci, Probability models and statistical analyses for ranking data. 
Springer Lecture Notes in Statistics 80, Springer, New York, 1993. 

[Fu] W. Fulton, young tableaux, With applications to representation theory and geometry, Lon- 
don Math. Soc. Student Texts 35, Cambridge Univ. Press, Cambridge, 1997. 

[GS] J. Goodman and A. Sokal, Multigrid Monte-Carlo method. Conceptual foundations, Phys. 
Rev. D 40 (1989), 2035-2071. 

[HH] J. Hammersley and D. Handscomb, Monte Carlo Methods, Chapman and Hall, London, 
1964. 



Metropolis scans and Hecke algebras 



35 



[Hf] P.N. HOEFSMIT, Representations of Hecke algebras of finite groups with BN-pairs of classical 
type, Ph.D. Thesis, University of British Columbia, 1974. 

[Hu] J. Humphreys, Reflection groups and Coxeter groups, Cambridge University Press, 1990. 

[KS] J. Kemeny and L. Snell, Finite Markov chains. The University Series in Undergraduate 
Mathematics, Van Nostrand, Princeton, 1960. 

[KSo] R. KiLMOYER and L. Solomon, On the theorem of Feit-Higman, J. Combinatorial Theory 
A 15 (1973), 310-322. 

[Lei] G. Letac, Problemes classiques de probabilite sur un couple de Gelfand, in Lect. Notes in 
Math. 861, Springer- Verlag, New York, 1981. 

[Le2] G. Letac, Les fonctions spheriques d'un couple de Gelfand symetrique et les chaines de 
Markov, Advances in Appl. Prob. 14 (1982), 272-294. 

[Ma] C.L. Mallows, Non-null ranking models I, Biometrika 44 (1957), 114-130. 

[Mac] I.G. Macdonald, Symmetric functions and Hall polynomials. Second edition, Oxford Math- 
ematical Monographs, Oxford University Press, New York, 1995. 

[Mar] J. Marden, Analyzing and modeling rank data, Monographs on Statistics and Applied Prob- 
ability 64, Chapman &; Hall, London, 1995. 

[MR] N. Metropolis, A. Rosenbluth, M. Rosenbluth, A. Teller and E. Teller, Equa- 
tions of state calculations by fast computing machines, J. Chem. Phys. 21 (1953), 1087-1092. 

[Pa] I. Pak, When and how n choose k, in Randomization methods in algorithm design (Princeton, 
NJ, 1997), 191-238, DIMACS Ser. Discrete Math. Theoret. Comput. Sci. 43, Amer. Math. 
Soc, Providence, RI, 1999. 

[Ra] A. Ram, Seminormal representations of Weyl groups and Iwahori-Hecke algebras, Proc. Lon- 
don Math. Soc. (3) 75 (1997), 99-133. 

[RS] G. Roberts and S. Sahk, Updating schemes, correlation schemes, correlation structure, 
blocking and parameterization for the Gibb sampler, J. Royal Statist. Soc. 59 (1997), 291-318. 

[RX] K. Ross AND D. Xu, Hypergroup deformations and Markov chains, J. Theor. Probability 7 
(1994), 813-830. 

[SB] W. Shannon and D. Banks, Combining classification trees using MLE, Statistics in Medicine 
18 (1999), 727-740. 

[SC] L. Saloff-Coste, Lectures on probability theory and statistics (Saint-Flour, 1996), 301-413, 
Lecture Notes in Math. 1665, Springer, Berlin, 1997. 

[Sc] C. SCHOOLFIELD, Random walks on wreath products of groups and Markov chains on related 
homogeneous spaces, Ph.D. Thesis, Dept. of Appl. Math., Johns Hopkins Univ., 1998. 

[Si] J. Silver, Weighted Poincare and exhaustive approximation techniques for scaled Metropolis- 
Hastings and spectral total variation convergence bounds in infinite commutable Markov chain 
theory, Ph.D. Thesis, Dept. of Math., Harvard University, 1996. 



